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ABSTRACT. We derive conditions under which random sequences of polariza- 
tions (two-point symmetrizations) converge almost surely to the symmetric de- 
creasing rearrangement. The parameters for the polarizations are independent 
random variables whose distributions need not be uniform. The proof of conver- 
gence hinges on an estimate for the expected distance from the limit that yields a 
bound on the rate of convergence. In the special case of i.i.d. sequences, almost 
sure convergence holds even for polarizations chosen at random from suitable 
small sets. As corollaries, we find bounds on the rate of convergence of Steiner 
symmetrizations that require no convexity assumptions, and show that full rota- 
tional symmetry can be achieved by randomly alternating Steiner symmetriza- 
tions in a finite number of directions that satisfy an explicit non-degeneracy con- 
dition. We also present some negative results on the rate of convergence and give 
examples where convergence fails. 



1. Introduction 

Many classical geometric inequalities were proved by first establishing the in- 
equality for a simple geometric transformation, such as Steiner symmetrization or 
polarization. Steiner symmetrization is a volume-preserving rearrangement that in- 
troduces a reflection symmetry, and polarization pushes mass across a hyperplane 
towards the origin. (Proper definitions will be given below). To mention just a few 
examples, there are proofs of the isoperimetric inequality and Santalo's inequal- 
ity based on the facts that Steiner symmetrization decreases perimeter E6l ITOl 
and increases the Mahler product ll23ll . Inequalities for capacities and path inte- 
grals follow from the observation that polarization increases convolution function- 
als ll30l [T3l [Tl l2l and related multiple integrals (9l|24l|25]]. This approach reduces 
the geometric inequalities to one-dimensional problems (in the case of Steiner sym- 
metrization) or even to combinatorial identities (in the case of polarization). It can 
also be exploited to characterize equality cases El|3l|9l. A major point is to con- 
struct sequences of the simple rearrangements that produce full rotational symmetry 
in the limit. 

In this paper, we study the convergence of random sequences of polarizations 
to the symmetric decreasing rearrangement. The result of n random polarizations 
of a function / is denoted by S\y 1 ...w n f> where each W{ is a random variable that 
determines a reflection. We assume that the W% are independent, but not necessarily 
identically distributed, and derive conditions under which 

(1.1) Sw 1 ...w n f — > f* i n oo) almost surely . 
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Rearrangements have been studied in many different spaces, with various notions 
of convergence. We work with continuous functions in the topology of uniform con- 
vergence, while most classical results are stated for compact sets with the Hausdorff 
metric. These notions of convergence turn out to be largely equivalent because of 
the mono tonicity properties of rearrangements. 

For sequences of Steiner symmetrizations along uniformly distributed random 
directions, convergence is well known l22l l29l . It has recently been shown that 
certain uniform geometric bounds on the distributions guarantee convergence for 
a broad class of rearrangements that includes polarization, Steiner symmetrization, 
the Schwarz rounding process, and the spherical cap symmetrization ETI . Among 
these rearrangements, polarization plays a special role, because it is elementary to 
define, easy to use, and can approximate the others. Our conditions for convergence 
allow the distribution of the Wi to be far from uniform. We also prove bounds on 
the rate of convergence, and show how convergence can fail. Our results shed new 
light on Steiner symmetrizations. In particular, we obtain bounds on the rate of 
convergence for Steiner symmetrizations of arbitrary compact sets. 



Let X be either the sphere S d , Euclidean space M. d , or the standard hyperbolic 
space M d , equipped with the uniform Riemannian distance d(x, y), the Riemannian 
volume m(A), and a distinguished point o G X, which we call the origin. The ball 
of radius p about a point x G X is denoted by B p (x); if the center is at x = o we 
simply write B p . We denote by dist(x, A) = inf y6j 4 d(x, y) the distance between 
a point and a set, and by 



the Hausdorff distance between two sets. 

If A is a set of finite volume in X, we denote by A* the open ball centered at the 
origin with m(A*) = m(A). We consider nonnegative measurable functions / on 
X that vanish weakly at infinity, in the sense that the level sets {x : f(x) > t} have 
finite volume for all t > 0. (On the sphere, this condition is empty.) The symmetric 
decreasing rearrangement /* is the unique lower semicontinuous function that is 
radially decreasing about o and equimeasurable with /. Its level sets are obtained 
by replacing the level sets of / with centered balls, 



A reflection is an isometry a on X with a 2 = I that exchanges two complemen- 
tary half-spaces, and has the property that d(x, ay) > d(x, y) whenever x and y 
lie in the same half-space. On S d , we have the reflections at great circles, on M. d 
the Euclidean reflections at hyperplanes, and in the Poincare ball model of M d the 
inversions at (d—l) -dimensional spheres that intersect the boundary sphere at right 
angles. For every point x G X there exists a (d—l) -dimensional family of reflections 
that fix x, and for every pair of distinct points x, y there exists a unique reflection 
that maps x to y. 



2. Main results 




{x:f(x)>t} = {x:f(x)>t}*. 
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(a) Polarization 



(b) Steiner symmetrization 



FIGURE 1 . Two simple rearrangements of a set A. Polarization re- 
places a certain piece of A in H~ by its reflection in H + . Perimeter 
is preserved but convexity, smoothness, and non-trivial symmetries 
can be lost. Steiner symmetrization replaces the cross sections of 
A in a given direction by centered line segments. This creates a 
hyperplane of symmetry and decreases perimeter. 

Let a be a reflection on X that does not fix the origin. For x G X, denote by 
x = ax the mirror image of x, and let 

H + = {x : d(x, o) < d(x, o)} , H~ = {x : d(x, o) > d(x, o)} 

be the half-spaces exchanged under a. By construction, o G H + . The polarization 
of a function / with respect to a is defined by 



For obvious reasons, polarization is also called two-point symmetrization. 

We use a fixed normal coordinate system x = (r, u) centered at the origin, where 
r = d(x, o), and denote the parameter space by SI = [0, oo) x S^ -1 . On R d , these 
are just the standard polar coordinates. On X = M d and R d , normal coordinates 
define a diffeomorphism from (0, oo) x S d_1 to X \ {o}, but on X = S d the normal 
coordinate system degenerates at r = ir, where it reaches the south pole. For r > 0, 
let at r v \ be the reflection that maps o to the point with normal coordinates (r, u). 
The reflections {cr( rj ± n ) : r > 0} generate a one-dimensional group of isometries 
of X. As r — > 0, they converge uniformly to a reflection a u := o"(o,«) that fixes 
the origin and exchanges the half-space H+ (that has u as its exterior normal at o) 
with the complementary half-space H~. We do not identify (0, u) with (0, —u) in 
Q, although they label the same reflection on X. If u = (r, u)gH with r > 0, the 
polarization of / with respect to a w is denoted by S u . Given a sequence {w n } in S7, 
we denote the corresponding sequence of polarizations by S Ul Mn = S Ldn o ■ ■ ■ o S UJl . 

Let u be a unit vector in R d , and let / be a nonnegative measurable function that 
vanishes at infinity. The Steiner symmetrization in the direction of u replaces the 
restriction of / to each line {x = £ + tu : t G R}, where with its (one- 

dimensional) symmetric decreasing rearrangement. If the restriction of / to such a 




max{/(a;),/(x)}, if x G H+ , 
min{/(x), f(x)}, if x G H~ . 
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line is not measurable or does not decay at infinity, we set the Steiner symmetriza- 
tion of / equal to zero on this line. We denote the Steiner symmetrization of / by 
S(o,u)f> or simply by S u f. By construction, S u f is symmetric under a u . Note that 
Steiner symmetrization dominates polarization in the sense that 

for every direction u G S^ 1 and all r > (see Fig. 1). 

Polarization and Steiner symmetrization share with the symmetric decreasing 
rearrangement the properties that they are monotone (/ < g implies Sf < Sg), 
equimeasurable (m({Sf > t}) = m({f > t}) for all t > 0), and L p -contractive 
(\\Sf — Sg\\ p < ||/ — g\\ p ) for all p > 1. They also preserve or improve the 
modulus of continuity, which we define here as 

V(p)= sup \f(x)-f(y)\. 

d(x,y)<p 

The corresponding rearrangements of a set A C X are defined by rearranging its 
indicator function 1^. Conversely, the rearranged function can be recovered from 
its level sets with the layer-cake principle, 

roc roo 

/(*)=/ t {f>t} (x)dt, Sf(x)= l s{f>t} (x)dt. 
Jo Jo 

Different from standard conventions, we do not automatically identify functions 
that agree almost everywhere. We have chosen the symmetric decreasing rearrange- 
ment of a function to be lower semicontinuous. In particular, if A is a set of finite 
volume, then A* is an open ball. Polarization and Steiner symmetrization both 
transform open sets into open sets. Polarization also transforms closed sets into 
closed sets, but Steiner symmetrization does not. The literature contains a variant 
of the symmetric decreasing rearrangement that preserves compactness, where A* 
is a closed centered ball if A has positive volume, A* = {o} if A is a non-empty 
set of zero volume, and A* = if A = 0. Steiner symmetrization is again defined 
by symmetrizing along a family of parallel lines. 

A random polarization Sw is given by a Borel probability measure /i on f2 = 
[0, oo) x S d_1 that determines the distribution of the random variable W = (R, U), 
viewed as the identity map on 17. We assume that [i(R = 0) = 0; for X = S d 
we also assume that fi(R > ir) = 0. A random Steiner symmetrization Su is 
given by a Borel probability measure on S^" 1 , or equivalently, by a measure on $7 
with [i(R = 0) = 1. For sequences of random rearrangements {Sw 1 ...w„}n>i with 
each Wi independent and distributed according to a measure ^ on O, we use as the 
probability space the infinite product Q N with the product topology, and with the 
product measure defined by 

n 

P{W 1 €A 1 ,...,W n €A n )=Y[fH(A i ). 

i=l 

In this view, Wi = (R4, Ui) is the i-th coordinate projection on f2 N . 

Let (X) be the space of nonnegative continuous functions with compact sup- 
port in X. (If X = S d , this agrees with the space of all nonnegative continuous 
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functions on § d ). Our first theorem provides a sufficient condition for the almost 
sure convergence of a random sequence of polarizations to the symmetric decreas- 
ing rearrangement. 

Theorem 2.1 (Convergence of random polarizations). Let {Syv 1 ...w n }n>i be a se- 
quence of polarizations on X = S d , Mr, or M d , defined by a sequence of indepen- 
dent random variables {Wi}j>i on Cl. If 

oo 

(2.1) ^ P(d(a Wi ai, bj) < p) = oo 

i=l 

for every radius p > and every pair of bounded sequences {ai}, {bi} in X with 
d(bi, o) > d{fli, o) + 2p, then 

(2.2) P ( lim \\S Wl ... w J - nioo = V/ G C+(X)) = 1 . 



At first sight, the conclusion in Eq. (2.2 1, that the random sequence almost surely 



drives all functions in C+(X) simultaneously to their symmetric decreasing rear- 
rangements, looks stronger than Eq. ( |1.1[ ). As we show in the proof of Theorem 2. 1 



the statements are equivalent, because C^(X) is separable and polarization con- 
tracts uniform distances. Let L+(K) be the space of nonnegative p-integrable func- 
tions. Since polarization also contracts L p -distances and C+(X) is dense in L+(X), 



Eq. (2.2) extends to 



(2.3) P lim \\S Wl ...wJ ~ f\\p = V/ G L+(X) =1 (1 < p < oo 



The assumption in Eq. (2.1 1 implies that infinitely many of the pi assign strictly 
positive measure to every non-empty open set in $7. The measures may concentrate 
or converge weakly to zero as i — > oo, but not too rapidly. This causes typical ran- 
dom sequences to be dense in Q,. We are convinced that almost sure convergence 
holds under much weaker assumptions on the distribution of the random variables 



than Eq. (2.1 1. A related question concerns the conditions for convergence of non- 
random sequences {iOi} in CI. Clearly, convergence can fail if a sequence of polar- 
izations concentrates on a subset of Cl that is too small to generate full rotational 
symmetry. Since the polarization Su u -\ leaves subsets of B r j 2 unchanged, a se- 
quence of reflections must accumulate near r = to ensure convergence. 

It is, however, neither sufficient nor necessary that the sequence be dense in Cl: 
on the one hand, any given sequence of polarizations can appear as a subsequence of 



one for which convergence fails (Proposition 6. 1 d); on the other hand, a sequence of 



polarizations chosen at random from certain small sets can converge to the symmet- 



ric decreasing rearrangement (Theorem 2.2 1. Rather, convergence depends on the 



ergodic properties of the corresponding reflections in the orthogonal group 0(d). 

To state the result, we introduce some more notation. For u G S d_1 , let t u be the 
map from X to itself that fixes the half-space and reflects the complementary 
half-space H~ by a u . We visualize t u as folding each centered sphere down into 
the hemisphere antipodal to u (see Fig. 2a). Given x G X and G C S d_1 , we refer 
to the set 

£>G,x = {r Un ■ ■ ■ t Ui x : n > 0, m, . . . , u n G G} 
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(a) Folding a sphere (b) G = {u, v, w} C S 1 

Figure 2. The map t u folds each centered sphere in M. d into the 
hemisphere opposite to u across the hyperplane it 1 . In d = 2 
dimensions, if u, v, w are not contained in a semicircle and enclose 
angles that are incommensurable with it, does G have dense orbits 
Og,x in S 1 ? Do t u , t v ,t w generate full rotational symmetry? 



as the orbit of x under G. 

Theorem 2.2 (Convergence of i.i.d. polarizations). Let {Sw 1 ...W n }n>i be a random 
sequence of polarizations onlL = S , M. d , or M d , defined by independent random 
variables W% that are identically distributed according to a probability measure 
\x on O with fJ,({R = 0}) = 0. Let support jx be the smallest closed set of full 
p,-measure in Q, and set 

(2.4) G={u£ S^ 1 : (0, u) G support ft J . 

If the orbit Oq,x is dense in S^ 1 for each x € S d_1 , then Syv 1 ...w n converges to 



the symmetric decreasing rearrangement and Eq. (2.2 ) holds. 



In one dimension, polarizations need to accumulate on both sides of the origin to 
produce the desired reflection symmetry. In dimension d > 1, the precise character- 
ization of subsets G C S d_1 that have dense orbits in § d_1 is an open problem. A 
necessary condition is that G be a generating set of directions for the orthogonal 
group, in the sense that the finite products {o~ Un ■ ■ ■ cr Ul : n > 0, ui, . . . , u n G G} 
are dense in 0(d). Also, G cannot be contained in a hemisphere. A sufficient 
condition is that the antipodal pairs {mgG: — u G G} form a generating set of di- 
rections for 0(d), because for every u G S d_1 and every x G X, either o u x = t u x, 
or g u x = T- U x. Must G contain antipodal pairs? Do d + 1 directions suffice? (See 
Fig. 2b.) 

Generating sets of directions for 0(d) are well understood. For instance, if (i) 
the vectors in G span M. d ; (ii) G cannot be partitioned into two non-empty mutually 
orthogonal subsets; and (Hi) at least one pair of vectors in G encloses an angle that 
is not a rational multiple of tt, then G is a generating set of directions. (The third 
condition can be relaxed in dimensions d > 3.) Since d directions {m, . . . u,i} in 
general position are a generating set for O(d), the hypothesis of Theorem 2.2 can 
be satisfied even by measures whose support has only a finitely many accumulation 
points. 



RANDOM POLARIZATIONS 



7 



Theorems 2.1 and |2.2| imply the following statements about Steiner symmetriza- 
tion. 

Corollary 2.3. Let {S'[/ 1 ...[/ n }n>l be a sequence of Steiner symmetrizations on M. d 
along independently distributed random directions {U{\ in S d_1 . 
(a) (Convergence of random Steiner symmetrizations). If 

oo 

(2.5) ^P(d{U i ,v i )<p)=oo 

i=i 

for every radius p > and every sequence {v{} in S d_1 , then 

(2.6) P ( lim WSu^.uJ - r||oo = V/ € C+(R d )) = 1 . 



n— too 



(b) (Convergence of i.i.d. Steiner symmetrizations). The same conclusion holds, 
if instead, the random directions {Ui} are identically distributed according 
to a probability measure p on S rf_1 whose support contains a generating 
set of directions for 0(d). 

3. Related work and outline of the proofs 

The literature contains several different constructions for convergent sequences 
of rearrangements. In their proof of the isoperimetric inequality, Caratheodory and 
Study recursively choose the direction u n of the next Steiner symmetrization such 
that A n = S Un A n _\ is as close to the ball as possible iflOl . Lyusternik proposed 
a sequence that alternates Steiner symmetrization in the d-th coordinate direction 
with Schwarz symmetrization in the complementary coordinate hyperplane and a 
well-chosen rotation [21]. Brascamp, Lieb, and Luttinger alternate Steiner sym- 
metrization in all coordinate directions with a rotation 0. The constructions of 
Lyusternik and Brascamp-Lieb-Luttinger yield universal sequences, which work 
for all nonnegative functions on M d that vanish at infinity. 

A number of authors have addressed the question of what distinguishes con- 
vergent sequences of Steiner symmetrizations, and how to describe their limits. 
Eggleston proved that full rotational symmetry can be achieved by iterating Steiner 
symmetrization in d directions that satisfy a non-degeneracy condition |[T5l p. 98f]. 
Klain recently showed that iterating any finite set of Steiner symmetrizations on 
a convex body results in a limiting body that is symmetric under the subgroup of 
0(d) generated by the corresponding reflections [18]. On the other hand, Steiner 
symmetrizations along a dense set of directions may or may not converge to the 
symmetric decreasing rearrangement, depending on the order in which they are ex- 
ecuted (6|. We note in passing that, although the last three results are stated for 
convex sets, the proofs are readily adapted to functions in C+(IR d ), with the Arzela- 
Ascoli theorem providing the requisite compactness in place of the Blaschke se- 
lection theorem. By choosing the measure in Corollary |2.3p to be supported on a 
finite generating set of directions, we obtain an analogue of Eggleston's theorem 
for random sequences. 

Finding even one convergent sequence of polarizations is more difficult, because 
it is not enough to iterate a finite collection of polarizations. Baernstein- Taylor, 
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Benyamini, and Brock-Solynin argue by compactness that the set of functions that 
can be reached by some finite number of polarizations from a function / contains /* 
in its closure (21 SI [8j . The greedy strategy of Caratheodory and Study also works 
for the case of polarizations. Both constructions result in sequences that depend on 
the initial function. A universal sequence was produced by van Schaftingen [28]. 

In these papers, considerable effort goes into the construction of convergent (or 
non-convergent) sequences that are rather special. The question whether a ran- 
domly chosen sequence converges with probability one was first raised by Mani- 
Levitska Il22l . He conjectured that for compact subsets of R d , a sequence of Steiner 
symmetrizations in directions chosen uniformly at random should converge in Haus- 
dorff distance to the ball of the same volume, and verified this for convex sets. 

The Mani-Levitska conjecture was settled by van Schaftingen for a larger class of 
rearrangements that have the same monotonicity, volume-preserving, and smooth- 
ing properties as the symmetric decreasing rearrangement [27]. We paraphrase his 
results for the case of polarization. Van Schaftingen proves the convergence state- 



ment in Eq. (2.2 1 under the assumption that the random variables Wi are indepen- 



dent and their distribution satisfy the uniform bound 
(3.1) lim mi P(d(a Wn a,b) < p) > 

n— >oo 

for every a,b E X and every p > 0. In the proof, he first constructs a universal 
sequence, that is, a single non-random sequence {wj}j>i in f2 such that the sym- 



metrizations S U j 1 ,,, UJn f converge uniformly to /* for every / G C+(X). Eq. (3.1 1 
implies that typical random sequences closely follow the universal sequence for 
arbitrarily long finite segments, i.e., for every p > and every integer N > 1, 

P(3k : d(W k+n , un) < p for n = 1, . . . , N) = 1 . 



After taking a countable intersection over Pn = an d N G N, Eq. (2.2 1 follows 
with a continuity argument. 

The condition in Eq. ( |3.1[ ) is stronger than the corresponding assumption of The- 
orem 2.1 To see this, let {aj}, {bi} be a pair of bounded sequences in X, and 
choose a pair of subsequences {a^}, {h k } that converge to limits a and b. For k 
sufficiently large, 

d(a u a, b) < | =^ d(a w a ik ,b ik ) < p. 



If Eq. (3.1 1 holds, then P{d{aw l a>i k , bi k ) < p) does not converge to zero, and the 



series in Eq. (2.1 1 diverges. We later show examples that satisfy Eq. (2.1 1 but not 



Eq. (3.1) 



Independently, Volcic has given a direct geometric proof for the convergence of 
Steiner symmetrizations along uniformly distributed random directions [29]. His 
proof is phrased as a Borel-Cantelli estimate, which suggests that pairwise inde- 
pendence of the Wi might suffice for convergence (see [ 14, p. 50-51]). Upon closer 
inspection, there is a conditioning argument where the independence of the Wi 
comes into play. It is an open question if convergence can be proved under weaker 
independence assumptions. 
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We are not aware of any prior work on rates of convergence for polarizations. 
There are, however, some very nice results regarding rates of convergence for 
Steiner symmetrizations of convex bodies. Klartag proved that for every con- 
vex body K C M. d and every < e < 1/2, there exists a sequence of n = 
\cd A log 2 (l/e)] Steiner symmetrizations u\, . . . , u n such that 

(3.2) d H (dS Ul ... Un K, 8K*) < e ■ radius (K*) , 

in other words, (1 — e)K* C S ul ___ Un K C (1 + e)K* . This means that the distance 
from a ball decays faster than every polynomial |[T9l Theorem 1.5]. Remarkably, c 
is a numerical constant that depends neither on K nor on the dimension. The control 
over the dimension builds on the earlier result of Klartag and Milman l20l that 3d 
Steiner symmetrizations suffice to reduce the ratio between outradius and inradius 
of a convex set to a numerical constant. Around the same time, Bianchi and Gronchi 
established bounds on the rate of convergence in the other direction 0. For each 
n and every dimension d, they construct centrally symmetric convex bodies in M. d 
whose Hausdorff distance from a ball cannot be decreased by any sequence of n 
successive Steiner symmetrizations. Their construction yields a lower bound on 
the distance from a ball for arbitrary infinite sequences of Steiner symmetrizations. 
Klartag 's results have recently been extended to random symmetrizations of convex 
bodies [12]. It is not known whether convergence is in fact exponential, and whether 
Klartag's convergence estimates can be generalized to non-convex sets. 

The proofs of Mani-Levitska, van Schaftingen, and Volcic involve a detailed 
analysis of typical sample paths. Since they rely on compactness and density argu- 
ments, they do not yield bounds on the rate of convergence. In contrast, Bianchi- 
Gronchi and Klartag use probabilistic methods to find non-random sequences with 
desired properties. The construction of Bianchi and Gronchi takes advantage of 
ergodic properties of reflections. Klartag views the rearrangement composed of a 
random rotation followed by Steiner symmetrizations in each of the d coordinate 
directions as one step of a Markov chain on convex bodies. He replaces the Steiner 
symmetrizations by Minkowski symmetrizations to obtain a simpler Markov chain, 
which acts on the support function of a convex body as a random orthogonal pro- 
jection in L2. Since this simpler process is a strict contraction on the spherical 
harmonics of each positive order, the support function converges exponentially (in 



expected L2-distance) to a constant. He finally obtains Eq. (3.2i from a subtle geo- 
metric comparison argument. 

We combine an analytical approach similar to Klartag's with the geometric tech- 
niques used by Volcic. The sequence {Sw 1 ...w„f}n>i defines a Markov chain on 
the space C+ (X). We use that the functional 

(3.3) Af)= [ f(x)d(x,o)dm(x) 

Jx 

decreases under each polarization, and make Volcic 's conditioning argument ex- 
plicit by appealing to the Markov property. Here, dm(x) denotes integration with 
respect to the standard Riemannian volume on X = S d , M d , or M. d . For the proof of 



Theorem 2.1 we quantify the expected value of the drop T{f) — Z(Swf) in terms 
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°f 1 1 / — /* 1 1 oo and the modulus of continuity of /. Since the expected drop goes to 
zero, Syvi,..w„f converges uniformly to /*. 



For the case of i.i.d. polarizations considered in Theorem 2.2 the challenge is 
that their distribution may be supported on a small set. Here, we resort to a com- 
pactness argument. By monotonicity, l(Sw 1 ...w„f) approaches a limiting value. 
Under the assumptions of the theorem, the drop of 1 has strictly positive expecta- 



tion unless f = f* (Lemma 4.3 ). This forces the limits of convergent subsequences 
to be invariant under a family of transformations (the folding maps t u parametrized 
by Eq. ( |2.4| )), which play the role of competing symmetries ifTTIl : the only functions 
that are invariant under the entire family are constant on each centered sphere. 

Our estimates for the expected drop of 1 imply bounds on the rate of convergence 
that depend on the modulus of continuity of / and the distribution of the Wi. In the 
case where the Wi are uniformly distributed on a suitable subset of Q, we show that 
there exists a numerical constant c such that 

E(\\S Wl ... w J - tlU <cLLip(/)n-^i 

for every Lipschitz continuous nonnegative function / on M. d with support in Bl 



(Proposition 5.2 1. On the other hand, there exist Lipschitz continuous functions / 



with support in Bl such that 

e(\\s Wi ... w j - nioo) > c\\f - nioof 



where c > and q G (0, 1) are numerical constants (Proposition 6.1 1). 
For Steiner symmetrization, we use that 

(3.4) 1(f) < l(S u f) < l(S {r , ±u) f) < 1(f) 

for every u G S ' -1 and all r > to bound the expected value of the drop 1(f) — 
l(Sjjf) under a random Steiner symmetrization from below by the corresponding 
estimate for a random polarization (Corollary \23\ . By the same token, the power- 
law bounds on the rate of convergence extend to Steiner symmetrizations along 



uniformly distributed directions (Corollary 5.4 1. Since we ignore that Steiner sym- 
metrization reduces perimeter, these bounds cannot be sharp, but to our knowledge 
they are the only available bounds that do not require convexity. It is an open ques- 
tion whether the sequence converges exponentially, and how the rate of convergence 
depends on the dimension. Is it more effective to alternate Steiner symmetrizations 
along the coordinate directions with a random rotation, as in lfl9l ? Does it help to 
adapt the sequence to the function? Do polarizations converge more slowly, perhaps 
following a power law? 



4. Almost sure convergence 

We start by preparing some tools for the proof of the main results. Let 1 be the 
functional defined in Eq. ( |3.3| ). The first lemma is a well-known identity, which is 
related to the Hardy-Littlewood inequality J fg < f f*g*. We reproduce its proof 
here for the convenience of the reader. 
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Lemma 4.1 (Polarization identity). Let f be a nonnegative measurable function 
with 1( f) < oo, and let S w be a polarization. Then 



X(f)-X(S UJ f) = / [f(a u x)-f(x)] + [d(o u x,o)-d(x,o)] + dm(x). 

JX 

In particular, X(f) > X(S u] f) unless S w f = f almost everywhere. 

Proof. We rewrite the functional as an integral over the positive half-space H + 
associated with uj, 

X(f)-X(S u f) = / {(f(x)-S w f(x))d(x,o) + (f(x)-S UJ f(x))d(x,o)}dm(x), 

JH+ 

where x = cr u (x). If f(x) > f(x) for some x G H + , then the values of S^f at 
x and x agree with the corresponding values of /, and the integrand vanishes at x. 
If, on the other hand f(x) < f(x), then the values are swapped for S u f, and the 
integrand becomes (f(x)—f(x))(d(x, o)—d(x, o)), where both factors are negative. 
We switch the signs, collect terms, and integrate to obtain the claim. □ 



The next lemma is the key ingredient in the proof of Theorem 2. 1 



Lemma 4.2 (Expected drop of X). Let f be a nonnegative continuous function with 
compact support in Br, C X/or some L > and modulus of continuity ij. Set 
e = ||/ — /*||oo. let p > be so small that rj(p) < |, and let W = (R, U) be a 
random variable on 17, as described above. Then 

(4.1) E(X(f) - X(S w f)) > C £ ■ inf P(d(a w x, b) < p) , 

x,b 

where C e = epm(B p )/2, and the infimum extends over x, b with d(x, o) + 2p < 
d(b, o) < L — p. Furthermore, on X = IR d , 

(4.2) E(X(f)-X(Suf))>C' e - inf P(2Lsmd(U,v) < p) , 

vest- 1 

where C' £ = epm(B p )/8. 

Proof. We first construct a pair of points a, b € X such that 

d(b,o)>d(a,o)+Ap, f(b)>f(a) + £ - 

(see Fig. 3a). By assumption, there exists a point xq with |/(xo) — f*(x^)\ = e. Set 
t = ^(f(x ) + f*(xo)), let A = {x : f(x) > t}, and let A* be the corresponding 
level set of /*. If f(xo) < f*(xo), we set a = xq. By construction, a G A* \ A. 
Since this set is open and non-empty, it has positive volume, and therefore A \ A*, 
having the same volume, is non-empty. Let b G A \ A*. Then f*(a) — f*(b) > 
f*(a) - t = s/2. Similarly, if f(x ) > f*(x ), we set b = x G A \ A*, find 
a G A*\A, and note that f*(a) - f*(b) > t- f*(b) = e/2. Since the modulus of 
continuity of / is valid also for /* and r](Ap) < e/2, we have d(b, o) — d(a, o) > 4p. 
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(a) Choice of a, b in Lemma [42] (b) The estimate in Eq. \53) 

Figure 3. Polarization swaps the part of A \ a u A that lies in in 
H~ with its mirror image in H + . If A is a level set of / and <j w is 
the reflection that maps a to b, then T(S w f) < T(f). The volume 
of A A A* decreases by the combined volume of the light and dark 
shaded subsets. 



By Lemma [47T| and Fubini's theorem, a random polarization S\y satisfies 

E(l(f)-l(S w f)) 

= E ( / [f(vwx) ~ f(x)] + [d(a w x, o) - d(x, o)] + dm{x) 



> / P(d(awx, b) < p) dm(x) , 

2 JB p (a) 

because the choice of a and b ensures that f{ay/x) — f{x) > e/4 and d{ay/x, o) — 
d{x, 6) > 2p for all x G B p (a) with d(awx, b) < p. Eq. ( |4.1[ ) follows by minimiz- 
ing over x, a and b and evaluating the integral. 

For a random Steiner symmetrization Sjj, we use Eq. (3.4) to obtain 

E(l(f)-l(Suf)) 

> E ( sup / [f{&(r,±U) x ) ~ f( x )] + [ W(r,±U) x \ — M ] + dm(x) ) 

\r>0,± JX J 

> ^(li nf l wa - 6 K,/^ (a) f^)j 

> e -^-m{B p {a)) ■ P(2L sin d(U,v) < p) , 
8 

where v is the unit vector in the direction of b — a, and d(U, v) is the enclosed 
angle. In the second line, the infimum runs over (r > 0, ±), and we have used that 
f(ax) — f(x) > e/8 and \ax\ — \a\ > p whenever \x — a\ < p and \aa — b\ < p. 
In the last line, we have estimated the infimum by 

inf \(T( r ±TT\a — b\ = infla + tf/ — b\ < (lal + \b\) s'md(U, v) , 

r>0,± v ' ; teffi 

and applied Fubini's theorem. □ 

Proof of Theorem 2.1 Given / £ C^iW 1 ), let F n = Sw±...w n f be the result of n 
random polarizations of /. Since F n = Sw n Fn-i, the sequence X{F n ) decreases 



RANDOM POLARIZATIONS 



13 



monotonically and satisfies 

1(f) > X(i^-i) > T{F n ) > l(f*) . 

By writing the difference as a telescoping sum and taking expectations, this implies 
that 

1(f)- 1(f) 

> E^fyL(F n ^)-l(S 

oo 

= E(E(X(F n ^) - l(S Wn F n _ l ) \Wi... W n _ x )) 

n=l 

(4.3) > C e -f2UnfP(d(a Wn x,b)> p)-P(\\F n _ 1 -f*\\ 00 >e)\, 

n=l J 

where the infimum extends over all x, b with d(x, o) + 2p < d(b, o) < L — p, and 
C £ , p, and L are positive constants that depend on /. We have used the Markov 
property in the second step, and applied Eq. ( |4.1| ) of Lemma 4.2 in the third. In 
particular, the sum in Eq. (|4.3| > converges. Since the first factors in the product are 
not summable by Eq. \2.l\ , the second factors must have zero as an accumulation 
point. By monotonicity, they converge to zero. Since e > was arbitrary, we 
conclude that 

pflim \\F n -f*\\ oo = 0) =1. 

\n— >oo / 

This establishes Eq. ( |1.1| >. 

To complete the proof, we choose a countable dense subset Q C C+. Let {wj}j>i 
be a sequence in Q. Since polarizations and the symmetric decreasing rearrange- 
ment contract uniform distances, we have for every pair of functions f,g£ and 
every n > 1, 

\\Su)i...u> n f — f 1 1 do 5: 2| |/ — g\ |oo + I |5'aji...aj ri 5 ~~ 9 Woo ■ 

We take n — > oo and minimize over g G to obtain, by the density of Q, 

lim \\S Wl .. Mn f ~ f*\\oo < inf |2||/ -5IU + Hm ||5 Wl ... u ,„5 - /Hoc \ 

n— »oo g£G k n— >oo J 

< sup lim I \S Ul ... Wn g - g*\\oo ■ 
gag n ~>°° 

Since Q is countable, it follows that 



P 3/€C+: lim ||5 l y 1 ... iy „/-r|| 00 >0 

1 n— >oo 

< P (35 G ^ : lim ||5 , H / 1 ...iy„5' - 5*Hoo > 



n— >oo 

< VP( lim ||5 Wl ... Wn 5-^||oo>0 

' \ n— >oo 

= 0, 



proving Eq. (2.2). □ 
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For the proof of Theorem 2.2 we need one more lemma. 



Lemma 4.3 (Identification of symmetric decreasing functions). Let f G C+(X). 

(a) (by polarization). Let W be a random variable on Q whose distribution 
satisfies [i(R = 0) = 0. If the orbit of each x G under G = {u G 
S rf_1 : (0, u) G support fi} is dense in S d_1 , then 

E(X(S w f))=l(f) / = /*. 

(b) (by Steiner symmetrization). Let U be a random variable on S^" 1 , and let 
fi be its probability distribution. If the support of \x contains a generating 
set of directions for 0(d), then 

E(l(Suf))=l(f) / = /*. 



Proof. For part (a), suppose that E(X(Sw /)) = %(f)- It follows from Lemma 4.1 
thatX(5 a ,/) = and hence S^/ = /, for /i-a.e. w. This means that /(r^x) > 
/(x) for /i-a.e. a; and all x G X. 

Let u G G. By assumption, /i assigns strictly positive measure to each neighbor- 
hood of (0, u) in Q. Since = 0}) = 0, we can find a sequence = (r,, Uj) 
with ri > that converges to (0, u) such that f(r Wi x) > f(x) for each i and all 
x G X. By continuity, f(r u x) > /(x) for all x G X, which means that the value 
of / increases monotonically along orbits r Un . . . t Ui x of G. Since 0g,ie is dense in 
the sphere of radius \x\ and / is uniformly continuous, / must be radial. 

To see that / is symmetric decreasing, we write it as f(x) = (fi(d(x,o)) for 
some continuous function <f>. Consider first the cases X = R d and M d . Given 
t > 0, choose oj = (r,u) with < r < 2t such that /(r^x) > f(x) for all 
x G X, and let a be the point with normal coordinates (r, u). The reflection 
maps the centered sphere of radius t to the sphere of the same radius centered at 
a. Since this sphere contains the points with normal coordinates (r ± t, u), by 
the intermediate value theorem it contains for each s G (t, t + r] a point x with 
d(x, o) = s. Since d(a ul x, o) = t < s, the point x lies in the negative half-space 
H~ . It follows that <j)(s) = f(x) < /(r w (x)) = <ft(t). Iterating the argument, we 
conclude that <p(s) < <p(t) for all s > t > 0. Since <j) is continuous, we can take 
t — > and conclude that 4> is non-increasing on [0, oo). In the case X = E> d , the 
above argument remains valid for t G (0, it), provided that r < min{2£, tt — t}, and 
we obtain that <fi is nonincreasing on [0, tt]. This proves that / = /*. 

For part (b), suppose that E(I(Suf)) = 1(f)- We augment the random di- 
rection U to a random variable W = (R, ±C7) on Q, where R is exponentially dis- 
tributed on M + , the positive and negative signs are equally likely, and the three com- 
ponents are independent. Then E(I(S^±jj^f) = 1(f) by Eq. ( |3.4[ ). The probabil- 
ity distribution of W is given by the measure dv(r, u) = \e~ r dr(d^(u) + dfi(-u)) 
on Q. By construction, v({R = 0}) = 0. Since the support of // contains a gener- 
ating set of directions for O(d), the orbit of any vector x G S^ 1 under 



G = {(0, u) G support v} = {±u : u G support /i} 
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is dense in S d 1 . Therefore, v satisfies the assumptions of part (a), and we conclude 
that / = /*. Finally, the converse implications hold because S w f* = f* for all 

co e o. □ 



Proof of Theorem 2.2 Let Wbea random variable on Q, that is distributed accord- 



ing to the measure \i from the statement of the theorem. Lemma 4.3 guarantees 
that E(l(Swf)) < Z(f) unless / = /*. Let Cl jV be the set of all nonnegative 
continuous functions supported in the ball of radius L whose modulus of continuity 
is bounded by rj. Since X is continuous in the uniform topology and Cl j7) is compact 
by the the Arzela-Ascoli theorem, 

h{e) := M{E(I(J) - X(S w f)) : / G C L , n , ||/ - f* IU > e} > 

for each e > 0. 

Given / G C^, let rj be its modulus of continuity, and assume that / is supported 
in Bl. Denote by F n = Swi...w„f the result of n random polarizations of /. 
Since polarization preserves the modulus of continuity and the ball Bl, we have 



F n £ Cl,ti- We argue as in the proof of Theorem 2. 1 that 



i(f) -ad > 



oo 

E 

n=l 



E{E(Z(F n ^) - X{S Wn F n ^) \W 1 ... W n _i)) 



(4.4) 



> h(e)-J2 p (\\Fn-i-r\\oo>e) 



n=l 



In the second line, we have used the Markov property and the definition of h(e). 
Since h(e) > 0, the sequence F n converges almost surely uniformly to /*, and 
Eq. ([Z2]) follows. □ 



Proof of Corollary 2.3 We proceed as in the proofs of Theorems 2. 1 and 2.2 with 
Eq. (4.2 ) and Lemma 4.3 3 in place of Eq. (|4.1[) and Lemma 4.3 1. □ 



5. Examples in Mr 

The following lemma allows to transform integrals over ft into integrals over Mr. 
Geometrically, we map co = (r, u) to the image of a point a under the reflection 
g w . Since for every point z ^ a there exists a unique reflection that maps a to 
z, this defines a diffeomorphism from \ {r = 0} to M. d \ {a}. For a = o, the 
diffeomorphism agrees with the polar coordinate map. 

Lemma 5.1 (Change of variables). Let a G W 1 . Then 



g(a LJ a) dco 



l{z)\z - ar (d - 1} dz 



for every measurable function g on M. d such that the integral on the left hand side 
converges. Here dco = drdm(u) denotes the uniform measure on £1. 



Proof. Set z = a^a. If we write co = (r, u) and express z — am polar coordinates 
(s, v), then v = u because the lines x = £ + tu are invariant under a w . If r moves 
by a certain distance, then z moves by that distance in either the direction of u or in 
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Figure 4. The change of variables in Lemma 5.1 In polar co- 
ordinates centered at a and o, the volume element transforms as 

dsdv = drdu. 



the opposite direction (see Fig. 4). In polar coordinates, the metric on f2 transforms 
as (ds) 2 + (dv ) 2 = (dr) 2 + (du) 2 . The claim follows by returning to Cartesian 
coordinates for z. □ 

We use this formula to construct examples of measures that satisfy the hypoth- 



esis of Theorem 2.1 but not Eq. (3.1 ). Consider the Gaussian probability measure 
on U. d whose density is the centered heat kernel at time t. By changing to polar 
coordinates, we obtain a probability measure on Q, given by 



p{A) 



1 



(27Tt) 



2t r° 



~ l du , 



where to = (r, u). Fix p, L > 0, let a, b be a pair of points in M. d with \a\ + 2p < 
\b\ < L — p, and consider the event {uj : d{a ul a,b) < p}. If z = a w a G B p (b), 
we use that \z\ — \a\ < r < \z\ + |o| to see that r G [2p, 2L]. It follows that there 
exists a constant C (depending on p, L, and the dimension but not on t) such that 

d 



the density of p in this region is bounded from below by Ct" 2e 
variables with Lemma|5.1l we estimate 



2L Z 
t 



Changing 



p({uj : d(a u a, b) < p}) > Ct-*e~— / \z - a\~ {d ~ l) dz > C't'^e 

JB p (b) 

Therefore P, the product of a sequence of such measures, satisfies 

i=l i=l 

l d with |a;| + 2p < |6i| < L - 



for any pair of sequences {aj}, {bi} in M d with |aj| + 2 p < \bi\ < L — p . For 
U = (log log f) -1 the sum diverges as required by Eq. (2.1), but Eq. (3.1 1 fails 
because the m easu res converge weakly to zero. For the sequence U = i 2 ^ d , Eq. ( 2. 1 1 
holds but Eq. (3.1 1 fails because the measures concentrate on {0} x S rf_1 . 



To give a similar example for Steiner symmetrizations, consider the probability 
measures on S rf_1 defined by the Poisson kernel 



p(A) 



1 



1 - 

A V 



dm(u) , 
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where z is a point in the ball, and dm denotes integration with respect to the standard 
Riemannian volume in S d_1 . Since the density of p, with respect to the uniform 
probability measure on S d_1 is bounded from below by 2~( d ~ 1 ^ (1 — \z\ ), the product 
of such measures satisfies 

oo / R \ oo 

Ew^xp)>2-(^)^ t x;(i-n) 

i=l ^ ; i=l 

for every sequence in S d_1 . If 2j = (1 — l/i)u for some u G S d_1 , then 



the sum diverges and Eq. \2.5) holds, but condition ( |3.1| > fails because the measures 
converge weakly to the point mass at u. 

In principle, the proofs of Theorems 2.1 and |2.2| imply weak-type bounds on the 
rate of convergence. Eq. ( |4.3| > yields that 

P(\\ TP — f* II > e-^ < -E(f) ~ -^(/ ) 

Ul " ' llc °- £) - C £ -J^ =1 inf x , b P(d(a Wi x,b)>p)> 

where C £ and p depend on the modulus of continuity of /. Similarly, since \ \F n — 
f* | |oo is non-increasing, Eq. (4.4 1 yields that 

P(||iW*l|oc > e) < m ~ X . ir) n- 1 , 

h{£) 

where h(e) depends on the distribution of the random polarizations and the modulus 
of continuity of /. For i.i.d. uniform sequences of rearrangements, we have a more 
explicit bound: 

Proposition 5.2 (Rate of convergence for random polarizations). If {Wi}i>\ is a 
sequence of independent uniformly distributed random variables on (0, 2L) x S d_1 , 
then 

(5.1) ^(H^...^/ - rill) < 2^772(^2^) H/Hoon- 1 

for every nonnegative bounded measurable function on M. d with support in Bl. If 
additionally, f is Holder continuous with modulus of continuity rj(5) < c5 a for 
some a G (0, 1] and c > 0, then 

(5.2) E(\\S Wl ...wJ ~ f* Hoc) < WcL a n-^ . 

In the proof of the proposition, we will use the following lemma. 

Lemma 5.3 (Expected drop in symmetric difference). If W is a uniformly dis- 
tributed random variable on (0, 2L) x § d ~\ then 

m(A A A*) - E(m(S w A A A*)) > — \- . (m(A A A*)) 2 

2dm(B2L) 

for every measurable set A C Bl in M d . 

Proof. Fix uj € O, and let H + and H~ be the half-spaces associated with oj. By 
construction, polarization swaps the portion of A \ a w A that lies in H~ with its 
mirror image in H + (see Fig. la). Of these sets, precisely the portion of A \ A* 
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whose reflection lies in A* \ A contributes to the symmetric difference AAA*, 
twice, see Fig. 3b. But this just means that 

(5.3) m(A A A*) - m{S UJ A A A*) = 2m{{x G A* \ A : a u (x) G A \ A*}) . 

We compute the expectation, using Fubini's theorem and the change of variables 
from Lemma IBTTI The result is 

m{A A A*) - E(m(S w A A A*)) 

2 / P{a w {x) eA\A*)dx 
Ja*\a 

\x - zr( d-1 ) dzdx 



1 



1 



A*\A JA\A* 
2 



> —(m(A A A*)) , 

where C = 2dm(B2L)- In the last step, we have used that the distance between 
x and z is at most 2L, and that A and A* have the same volume. Note that the 
Riemannian volume of the unit sphere in M. d is related to the Lebesgue measure of 
the unit ball by m(S d " 1 ) = dm{B 1 ). □ 

Proof of Proposition \5.2\ Consider first the case where f = 1a for some measur- 



able set A C Bl, and let A n = Sw n ...W^- By Lemma 5.3 the Markov property, 
and Jensen's inequality, 

E(rn(A n -i A A*)) - E(m(A n A A*)) 

= E(m(A n ^ A A*) - E(S Wn A n ^ A A*) | W\, W n -{j) 



> ^E((m(A n ^AA*)) 2 ) 

> ^{E{m(A n ^AA*)))\ 

where C = 2dm{B 2 L)- This shows that z n = C^ 1 E(m(A n A A*)) satisfies the 
recursion relation z n < z n -\(\ — z n -\). Since z^ 1 > z~\ + 1 and Zq 1 > d2 d+l , 
it follows that 

(5.4) E(m{A n AA*))<C{n + d2 d+1 )- 1 . 

If / is a nonnegative bounded measurable function on Bl, we use the layer-cake 
principle to write 

ll/-rili= / m({/> S }A{/* >s})ds, 
Jo 

and likewise for F n = S\y n ...w n f- Since / is bounded, the integrand vanishes for 
s > 1 1 / 1 1 oo , and we obtain from Eq. ( |5.4| ) that 

^(ll^-ni!) < C7||/|| 00 (r 1 + d2 d + 1 )- 1 , 

proving the first claim. 

If / is Holder continuous, then F n and and /* are Holder continuous with the 
same modulus of continuity. Let e = \\F n — /*||oo. an d set p = ?7 _1 (e/4). Since 
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F n differs from /* by at least s/2 on some ball of radius p, we have | \F n — /* 1 1^ > 



em(B p )/2. We obtain from Eq. (5.1 1 that 



d 



2(4c); 
m(Bi) 



^((ii^-riioo) 1 ^) < ^^^(ii^-riii) 



< d2 d (4cL a ) 1+ an~ 1 . 
Applying Jensen's inequality once more, we arrive at 

E(\\F n - nU) < (d2 d )^4cL a n~^ . 
The leading constant is maximized at a = 1 and d = 6, and Eq. (|5.2[) follows. □ 



By Eq. ( |3.4| ), Proposition 5.2 extends to Steiner symmetrization along directions 
chosen independently and uniformly at random on S d_1 . 

Corollary 5.4 (Rate of convergence for random Steiner symmetrizations). If{Ui}i>\ 
is a sequence of independent uniformly distributed random variables on §> d ~ l , then 

EiWSu^.uJ - nil) < 2dm(B 2L ) H/lUn- 1 

for every nonnegative bounded measurable function with support in B^. If f is 
Holder continuous with modulus of continuity r/(5) < c5 a for some a E (0, 1] and 
c > 0, then 

E(\\S Wl ... w J - riloo) < !0cL a n-^ . 

6. Negative results 

In this section, we give some bounds on the rate of convergence that complement 
Proposition 5.2 and Corollary |5.4| and construct examples where convergence fails. 



For polarization, we use the function 

(6.1) f(x) = [l-d(x,a)] + , 

which is supported on B\ (a) and Lipschitz continuous with constant one. Its sym- 
metric decreasing rearrangement is given by /* (x) = [1 — \x\\ + , and 1 1 / — /* | |oo = 
mm{d(a, o), 1}. Its polarization at ui G Q is given by 

Scjf(x) = [1 - d(x,T UJ a)} + , 

where r w is the folding map that fixes the positive half-space and reflects H~ 
across the separating hyperplane. 



Proposition 6.1. Let f be given by Eq. (6.1 1. 



(a) (Convergence of random polarizations is not faster than exponential). 

Ifd(a, o) < 1, then 

E(\\S Wl ... w J - fWvo) >\\f - 1*11^2-" 

for every sequence {Wi}i>± of independent random variables on f2 such 
that the distribution of each Wi = (Ri, Ui) is symmetric under Ui \— > —Ui. 

(b) (Non-convergence). If a ^ o, then there exists a dense sequence {ui\i>\ in 
0, such that 5 , Wl ... a;n / has no limit in C^(X). 
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Proof. A single random polarization results in S\yf{x) = [1 — d(x,Tyva)] + . Since 
T\ya = a whenever a € H^, its expected distance from the origin satisfies 
E(d(Twa,o)) > d(a,o)/2. By iteration, we have Sw 1 ...w n f(x) = [1 — d(x, a n )] + , 
where a n = Tyv n a n -i and ao = a. By the Markov property, E(d(a n ,o)) > 
d(a, o)2~ n , and the first claim follows. 

For the second claim, we realize an arbitrary sequence as a subsequence of 
one for which convergence fails. Given {wj}j>i, fix < e < d{a,o) and de- 
fine {Cui}i>i as follows. On the odd integers set &2n-l = (min{2 _n e, r n }, ±u n ), 
where (r n ,u n ) = u> n , and the sign is chosen in such a way that Sa^.^n-if is 
unchanged by S Un . On the even integers, set U2n = w n . If {wj} is dense, then {Qi} 
is dense as well. 

Set /„ = Sui 1 ...u> n f = [1 — d(x,a n )} + . Suppose that /„ converges to some 
limit g. Then g{x) = [1 — d(x, b)] + for some b. Let oj = (r, u) G f2 with r > 0. By 
density, we can find a subsequence {oJ nk } that converges to w. Since both a nfe _i 
and a nfc = r,2, nfc a rife _i converge to 6, we must have tJj = b. Since u was arbitrary, 



it follows that b = o. On the other hand, d(b, o) > d(a, o) — £^2 n > 0, a 
contradiction. □ 

The corresponding bounds for Steiner symmetrizations on R d are slightly more 
involved. As an example, we use the function 

(6.2) f(x) = [l-(x,Mx)} + , 

where M is a positive definite symmetric d X d matrix. The symmetric decreasing 
rearrangement of / is f*(x) = [1 — A*|x| 2 ] + , where A* is the geometric mean of 
the eigenvalues of M. The distance from / to /* satisfies 

/rL ox A max A m i n n „ j,^ 1 1 Amax Amin 

(O- 3 ) 7T\ <\\J ~ J Woo < 



We will prove the following statements. 



Proposition 6.2. Le£ / be given by Eq. ( |6.2| ) w/£/z some positive definite symmetric 
matrix M. 

(a) (Convergence of random Steiner symmetrizations is not faster than expo- 
nential). If {Ui} is a sequence ofi.i.d. uniform random variables on S rf_1 
and the extremal eigenvalues of M satisfy A max < 2A m j n , then 

^(ii^,.. C / n /-riioc)> Jn/-riioo3- n . 

(b) (Non-convergence). If M is not a multiple of the identity, then there exists a 
dense sequence {uj}j>i in S ^ 1 such that S Ul _ Un f has no limit in C^(M. d ). 

We first show that Steiner symmetrization preserves the form of /. 



Lemma 6.3 (Steiner symmetrization of ellipsoids). If f is given by Eq. (6.2 ), then 
S u f has the same form with a positive definite symmetric matrix M' determined by 

(6.4) (x, M'x) = (x, Mx) - ^' *f?\ + (x, u) 2 (u, Mu) . 

(u, Mu) 

In particular, u is an eigenvector of M' with eigenvalue (u, Mu). 
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Figure 5. Steiner symmetrization of an ellipse. The diameter 
shrinks at most by a factor cos a. 

Proof. Consider a line x = £ + tu with ( 1 «. The restriction of / to this line, 
given by 

[I- (^M£) -2t(£,Mu) -t 2 (u,Mu}}+ , 

is symmetric decreasing about to = — /^'^*a ■ By definition, the restriction of S u f 
to the line is the symmetrized function 

t ^ [l-(Z,M0 + (t 2 -t 2 )(u,Mu)] + , 



as required by Eq. (6.4). Since u and the line through u are invariant subspaces 
for M', we conclude that u is an eigenvector. The corresponding eigenvalue is 

A = (u,M'u) = (u,Mu). □ 



Remark. An amusing consequence of Lemma 6.3 is that (d— 1) Steiner symmetriza- 
tions suffice to transform an ellipsoid into a ball 11201 . To see this let A be an ellip- 
soid of the same volume as the unit ball. Then A = {(x, Mx) < 1} = {/ > 0}, 
where M is a positive definite symmetric matrix of determinant one, and / is 



given by Eq. (6.2). Set Mq = M, and choose u\ such that (ui,Mu\} = 1. By 
Lemma 6.3 S Ul A = {(x, Mix) < 1}, where Mi is a positive definite symmetric 
matrix that has u% as an eigenvector with eigenvalue 1. Iteratively choosing m or- 
thogonal to «i, ... , Ui-\ such that (ui, Mj_iUj) = 1, we arrive at Af^-i = I, and 
conclude that S Ud _ 1 . . . S Ul A = A*. □ 



To prove Proposition 6.2 we need to analyze how the extremal eigenvalues of M 
change under Steiner symmetrization of /. Clearly, their difference decreases, be- 
cause the inradius of the corresponding ellipsoid grows under Steiner symmetriza- 
tion, and its outradius shrinks. The following lemma shows that the change in the 
extremal eigenvalues is small, if the direction of the Steiner symmetrization is ei- 
ther almost parallel or almost orthogonal to the maximizing eigenvector w max (see 
Fig. 5). 

Lemma 6.4 (Eigenvalue estimate). Given a symmetric positive definite matrix M 
with extremal eigenvalues A max . A m i n and corresponding normalized eigenvectors 



"Vmax. "Vmm. Define M' by Eq. (6.4). The extremal eigenvalues A' max , A' min of M' 
satisfy 

(6.5) A' max - A' min > (l — Ctj)((u, Vmax)) - 2^?((«, fmin))) (A max A mm ) , 

where C = 1 + A max /A m i n and ift(t) = t 2 (l — t 2 ). 
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Proof. Let v be a normalized eigenvector of M with eigenvalue A. From Eq. (6.4 1, 
we obtain that 

(v,M'v) = X- X * U {U ^ +(u,v) 2 {u,M U ) 

= A + cos 2 a sin 2 a (1 + - — — ] ((w, Mw) — A) . 

V ( n > Mu ) J 

In the second step, we have expanded u = cos av + sin a w, where w is a unit 
vector orthogonal to v, and then collected terms. We apply this identity to v max and 
use that (u, Mu) > X mm and (w, Mw) < A max to obtain 

A' max > ("max, M'v maiX ) 



> A max - ( 1 + ) ip({u,v max )) (A 

Ami 



max /v mm ) 



Similarly, 



A m in - (Vmm,M'v m - m ) 

< A min + 2lp((u, Vmin)) (A max A r 

The claim follows by subtracting the two inequalities. □ 



>mm ) 



Lemma 6.5 (Expected change of extremal eigenvalues). Let f be given by Eq. {6.2\ 
with a positive definite symmetric matrix M whose extremal eigenvalues satisfy 
Amax < 2A nmi . If U is a uniformly distributed random variable on then 
Suf{ x ) = [1 — { x , M'x)] + , where M' is a positive definite symmetric matrix whose 
the extremal eigenvalues satisfy 

-^(Amax — A m j n ) > — (A max — A m i n ) . 

Proof. We apply Lemma|64]and take expectations. Let v ma x and Vmin be the eigen- 
vectors of M corresponding to A max and A m i n , and set C = 1 + A max /A m i n < 3 
and ip(t) = t 2 (l — t 2 ). By taking advantage of the rotation invaiiance, we com- 
pute E((U,v} 2 ) = 1/d and E((U,v) A ) = 3/(d(d + 2)) for all v G S*" 1 , see EH 
Exercise 63, p. 80]. This results in 

E(l - Cl/>({U, Vmax) - 2^((C7, v min ))) = 1 - (C + 2) ( \ 



d d(d + 2) t 

The claim follows by evaluating the right hand side at d = 3, where it assumes its 



minimum value, and using Eq. ( 6.5 ). □ 



Proof of Proposition \6.2\ Let / be given by Eq. (6.2) with some positive definite 
symmetric matrix M . We first consider the case of a random sequence F n = 
Su 1 ...u n f> where t he di rections {Ui} are independent and uniformly distributed on 



;<f-l 



By Lemma 6.3 we can write F n in the form (6.2i with a positive definite 



symmetric matrix M n that is recursively defined by Eq. (6.4i with u = U n . We 



iterate the estimate in Lemma |6.5| using the Markov property, and obtain that the 
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gap between the extremal eigenvalues of M n is at least (A max — A m i n )3 n . Since 
we assumed that A max < 2A m i n , it follows from Eq. (|6.3|) that 



Tp/\\Q f \ \ A max A m i n 

H\\ b U 1 ...U n J-j\\oo) > " 

•^A max 

> \\\f - r\\oo3- n . 

For the second claim, we proceed as in the proof of Proposition [6j] by realizing 
an arbitrary sequence as a subsequence of one for which convergence fails. Given 
{ui}i>i in S d ~\ let e > so small that (C + 2) sin 2 e < 1, where C = 1 + 



Amax/Amin as in Lemma 6.5 and construct the sequence {vi}i>\ as follows. In 
the first step, pick v\ to be a maximizing eigenvector of M. Suppose we have 
already chosen v±, . . . ,v n such that d(vi, Uj+i) < e/i for each i < n, and that 
ui, . . . ,Uj appear as a subsequence. If d(v n , Uj + \) < e/n, pick = Uj+i. 
Otherwise, choose v n+ \ on the great circle that joints v n with Uj+i in such a way 
that d(u n ,u n+1 ) = e/n and = d(v n ,«j + i) - e/n. Since J] e/n 

diverges, the entire sequence {m} is incorporated as a subsequence into {i>i}. If 
{ui} is dense, so is {i>i}. 

Let f n = S Vl ,,, Vn f = [1 — (x, M„x)] + . If / n converges to some limit g, then g 
is given by Eq. ( |6.2| ) with some positive definite symmetric matrix iV of the same 
determinant as M. Since {vi} is dense in S d_1 , we find that N is necessarily a 
multiple of the identity because g is invariant under every Steiner symmetrization. 
On the other hand, we can estimate the extremal eigenvalues of N as follows. By 
construction, v n is an eigenvector of M n . Since d(v n , v n+ \) < e/n and the other 
eigenvectors of M n are orthogonal to v n , we have that tp((v n+ i, v)) < sin 2 (e/n) 
for each eigenvector v of M n . Iterating Lemma [6\4) we see that the gap between the 
extremal eigenvalues of N is at least (A max — A m i n ) Y\(l — (C+ 2) sin 2 (e/n)) > 0, 
a contradiction. □ 



7. Compact sets 

We finally collect the implication of our results for compact sets. Under the 



assumptions of Theorems 2. 1 and 2.2 random polarizations of functions in L+ also 
converge almost surely in L p , see Eq. (12. 3b. In particular, for p = 1, 



(7.1) P lim m(S Wl ... Wn A A A*) = MA C X with m(A) < oo = 1 . 



This is another equivalent restatement of Eq. (2.2i. We now establish the corre- 
sponding convergence result for the Hausdorff distance. 

The topology defined by the Hausdorff metric on the space of compact sets is not 
comparable to the topology of symmetric difference. Moreover, polarization is not 
continuous with respect to Hausdorff distance. To give a simple example, consider a 
reflection o that does not fix the origin, and let a be the image of the origin under a. 
By definition, S({o, a}) = {o, a}. Let {aj}j>i be a sequence in X with ^ a for 
all i that converges to a. The sequence of two-point sets {o, a -/] clearly converges 
to {o, a}. Since the reflected sequence {aai} converges to the origin, we have that 
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d(cii,o) > d(aai,o) and therefore m G -ff - for i large enough. It follows that 
S({o, aj}) = {o, aai}, which converges in Hausdorff distance to {o}. 

Nevertheless, convergence of a sequence of polarizations in Hausdorff distance 
to a ball implies convergence in symmetric difference. To see this, let K be a 
compact set of positive volume, and consider a sequence K n = S Ul ,. Mn K. If 
K n converges to (the closure of) K* in Hausdorff distance, then the radius of the 
smallest centered ball containing K converges to the radius of K* , which implies 
that m(K n \ K*) converges to zero. Since K n and K* have the same volume, 
m(K* \ K n ) goes to zero as well. 

In the other direction, we can obtain convergence in Hausdorff distance from the 
uniform convergence statement in Eq. ( |2.2| ) by realizing a given compact set as a 
level set of a continuous function. 

Proposition 7.1 (Convergence in Hausdorff distance). If a random sequence {Wi} 



satisfies the assumptions of either Theorem 2.1 or Theorem \2.2\ then 



P ( lim d H (Sw- l ...W n K,K*) = V compact K C Xwithm(K) > ) = 1 



\n— >oo 

and 



p(l™d H (dS Wl ... Wn K,dK*)=0 lZZ(K) K > C Oa l ndrn(dK) = o) =1 - 

Proof. Set K n = Sw 1 ...w n K . We consider the two pieces of the Hausdorff distance 
from K n to K* separately. If dist(x,iv~ n ) = 5 > for some x G K*, then 
m(K n A K*) > 2m(B s (x) n K*) > 0. Therefore Eq. {7j} implies that 

sup dist (x, K n ) — > (n — > oo) almost surely 

xeK* 

simultaneously for all K. 

To control the other piece of dn(K n , K*), we use the auxiliary function f(x) = 
[1 — dist(x, K)] + . By definition, the level set of / at height 1 — t is the outer parallel 
set {x : dist(x, K) < t}. The level set of /* at that height is the centered ball of 
the same volume. Its radius p(t), defined by 

B p[t) = {x : dist(x, K) < t}* (t > 0) 

depends continuously on t and converges to the radius of K* as t — > 0. Set F n = 
Sw 1 ...w n f- Since F n (x) = 1 for all x G K n , 

sup dist (x, K*) = sup p(F n (x) — f*(x)) — radius (K*) 

xeK n x£K„\K* 

(7.2) —^0 (n — > oo) almost surely 



by Theorem 2. 1 This proves the first claim. 

If dK has zero volume, we continuously extend the function p such that 

B p(0) =K\ B p{t) = {x : dist(x,X\ K) > -t}* (t < 0) , 

and replace the auxiliary function with 

(7.3) f{x) = [h + dist(x, X\K)- dist(x, K)] + , 
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where h > is an arbitrary constant. The level sets of / at heights below h are 
outer parallel sets of K, while the level sets at heights above h are inner parallel 
sets. It follows that 

d H {dK n ,dK*) = sup \p(h- f*(x)) -radius (K*)\ 
xedK„ 

(V.4) < m|x|p(±||F n -/*lloo)-p(0)| 

— ^ (n — > oo) almost surely. 
In the second line, we have used that F n = h on dK n . The last line follows from 



Theorem 2.1 and the continuity of p. □ 



Similar arguments can be used to bound the rate of convergence for sets with 
additional regularity properties. Let K be a compact set in M. d , and define / and p 
as in the proof of the second claim of Proposition |7.1| Assume that K C Bl, and 



that p is differentiable at t = with p'(0) = Per (K) /Per (K*). By Proposition^ 
there exists a sequence {uji] such that 



i 



WS^J-nU < 10(L + /i)n-3+r . 
Expanding p about £ = 0, we obtain from Eq. ( |7.4[ ) that 

d H (dK n ,dK*) < P '(o) (i + (i)) H^...^/ - r Hoc 

Per CJlI i 
(7.5) < C • radius (K*) • — ^ (1 + o(l)) n" ^ 

asn-> cxd, where C = 10(L + h)/p(0). After dropping an initial segment n < N 
from the sequence, we may replace L with the radius of the smallest centered ball 
containing K^. Choosing N sufficiently large and h sufficiently small, we can find 
a sequence of polarizations where Eq. (|7.5|) holds with C = 10. 



Remark. The conclusions of Proposition 7.1 also hold for random Steiner sym- 
metrizations that satisfy the assumptions of Corollary 2.3 Likewise, Eq. ((73) 
applies to sequences of Steiner symmetrizations along i.i.d. uniformly distributed 
directions. However, in view of Klartag's result for convex sets, we expect such 
sequences to converge more rapidly (see Eq. (|3.2|>). 
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